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f Results  from  500  replications  of  each  of  six  cells  showed  that  OLS  and  jackknife 
estimators  did  not  differ  appreciably  on  either  criterion. 
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This  study  reports  the  results  of  a  Monte  Carlo  evaluation  of  the 
small  sample  performance  of  ordinary  least  squares  (OLS),  ridge,  and 
Jackknife  estimators  of  regression  coefficients.  The  primary  criteria  of 
evaluation  are  the  mean  square  error  (MSE)  of  the  regression  coefficients 
and  the  size  of  the  ^-statistics  associated  with  these  coefficients.  The 
conditions  studied  are  derived  from  a  design  In  which  two  factors  are 
varied:  the  sample  size  to  number  of  predictors  (^/g_)  ratio  and  the 
metric  quality  of  the  data  (dichotomous  and  polychotomous) . 

Results  from  500  replications  of  each  of  six  cells  showed  that  OLS 
and  Jackknife  estimators  did  not  differ  appreciably  on  either  criterion. 

Ridge  weights  had  substantially  smoller  MSE  at  the  lowest  levels  of 
N/£  (5:1).  At  high  levels  of  N/£  (20:1),  ridge  weights  had  a  slightly 
greater  MSE  than  either  OLS  and  Jackknifed  weights.  A  dilemma  In  the  use 
of  ^-statistics  for  ridge  weights  Is  presented. 
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MuUlcol  1  inearlty  produces  many  problems  in  the  analysis  of  data  by 
multiple  regression.  One  of  the  most  common  of  these  problems  is  the 
"bizarre"  regression  coefficients  that  are  sometimes  found.  In  some 
cases  one  or  more  of  the  standardized  coefficients  is  greater  than  unity 
in  absolute  value.  This  is  particularly  unsettling  when  two  items 
measuring  the  same  underlying  factor  and  having  approximately  equal 
correlations  with  the  dependent  variable  are  entered  into  an  equation  and 
one  of  the  coefficients  gets  the  "wrong  sign."  That  is,  the  sign  of  the 
regression  weight  is  the  opposite  of  the  sign  of  the  correlation  between 
that  variable  and  the  dependent  variable. 

A  wide  range  of  techniques  have  been  developed  to  deal  with  problems 
such  as  this  (see  Dempster,  Schatzoff  and  Wermuth,  1977,  for  a  study  of 
57  varieties  of  estimators).  Many  of  these,  including  variable  selection 
and  ridge  regression,  can  be  used  to  "shrink"  the  regression  weights  by 
ignoring  certain  aspects  or  reducing  the  influence  of  certain  parts  of 
the  data  matrix.  The  differences  among  the  procedures  arise  from  the 
ways  in  which  they  accomplish  this  function.  Variable  selection 
techniques,  the  most  commonly  recommended  solution,  reduce  the  effects  of 
mul ticol 1 inearity  by  deleting  variables.  Stepwise  regression  algorithms, 
for  example,  typically  admit  additional  variables  into  a  regression 
equation  only  if  the  multiple  correlation  of  the  additional  variable  with 
the  already  entered  variables  is  acceptably  low.  This  eliminates  the 
problem  of  variables  with  "wrong  signs"  by  deliberately  causing  the 
"wrong  signs"  to  be  replaced  by  zeros.  That  is,  the  equation  associated 
with  the  simultaneous  analysis  of  all  of  the  variables, 

[11  Y  -  B^xj  ♦  Bjx^  ♦  ...  ♦  flpXp  ♦  e 


where  is  the  dependent  measure,  the  are  the  regression  weights  for 
the  full  set  of  £  predictor  variables,  x^;  and  e  Is  the  error  of 
prediction.  Is  replaced  by  a  restricted  regression  equation, 

[2]  Y  -  BiXj  ♦  62*2  ♦  ...  ♦  Bj,x,^  (♦  OXjj^+j  +  ...  +  OXp)  ♦  e 

where  the  zeros  before  the  variables  within  the  parentheses  denote  that 
by  deleting  these  variables,  they  have  Implicitly  received  a  weight  of 
zero  In  the  regression  equation. 

A  recently  developed  alternative  to  variable  selection  Is  ridge 
regression  (Hoerl  and  Kennard,  1970,  a,  b).  Although  stepwise  regression 
differs  from  ordinary  least  squares  (OLS)  In  the  way  that  the  equations 
are  specified,  the  difference  between  ridge  regression  and  OLS  is  In  the 
computation  of  the  regression  weights.  OLS  estimates  of  the  regression 
weights  are  derived  from  the  formula, 

[3]  s  -  R-lr 

where  £  is  a  vector  of  standardized  regression  weights,  R  Is  the  matrix 
of  correlations  among  the  predictor  variables  and  £  is  the  vector  of 
predictor-criterion  correlations.  Ridge  coefficients  are  estimated  from 
the  equation, 

[4]  B*  -  (R  +  kl)-lr 

where  £  Is  an  arbitrary  constant  and  £  Is  an  identity  matrix.  In  ridge 
regression,  the  elements  on  the  diagonal  of  the  predictor  £  matrix  are 
Inflated  so  that  they  are  large  compared  to  the  off-diagonal  elements. 

As  the  size  of  the  constant  added  to  the  diagonal  Increases,  the 
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resulting  estimates  of  the  regression  coefficients  are  "shrunk"  toward 
zero.  This  shrinkage  produces  an  estimator  which  is  biased,  but  has  less 
variance  than  the  OLS  estimator.  The  amount  of  bias  Increases  with  the 
magnitude  of  the  constant,  k^. 

Although  the  proper  method  of  determining  the  value  of  the  constant, 
Ic,  has  not  been  fully  resolved,  the  subjective  rules  proposed  by  Hoerl 
and  Kennard  (1970,  a,  b;  see  also  Price,  1977),  have  been  supplanted  by 
objective  rules.  Hoerl,  Kennard  and  Baldwin  (1975),  for  example,  have 
recotnnended  the  use  of  the  value, 

[5]  k  ■  pa^/ZBi^ 

2 

where  £  is  the  number  of  predictor  variables,  £  is  the  error  variance 
and  the  are,  as  before,  the  standardized  regression  coefficients. 

In  addition  to  concerns  for  reasonable  looking  regression  weights, 
researchers  are  often  interested  in  determining  the  statistical 
significance  of  the  coefficients.  This  often  involves  the  computation  of 
standard  errors  and  £-stati sties  for  the  individual  coefficients.  When 
high  levels  of  multicol linearity  are  present,  these  statistics  are  also 
affected.  Multicollinear'ity  produces  standard  errors  which  are  quite 
large  and  ^-statistics  which  are  small.  This  problem  has  caused  some 
statisticians  (see,  for  example.  Press,  1975)  to  recomnend  that  highly 
correlated  variables  be  deleted  and  the  regression  equation  reestimated. 
Note  that  this  solution  to  the  problem  Is  an  informal  algorithm  for 
backward  deletion  (Draper  and  Smith,  1966)  which  Is  another  type  of 
variable  selection. 
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A  ready  tool  which  might  be  compared  to  OLS  estimates  of  standard 
errors  and  ^-statistics  Is  the  jackknife  (Hosteller  and  Tukey,  1968). 

The  Jackknife  procedure  Is  a  simple  and  widely  applicable  technique  for 
computing  robust  estimates  of  many  statistics  and  their  associated 
standard  errors.  Its  greatest  value  lies  In  situations  In  which  there 
are  no  readily  available  techniques  for  calculating  the  standard  error  of 
a  coefficient  or  when  the  analyst  has  reason  to  believe  that  conventional 
estimates  of  the  standard  error  are  Inappropriate. 

The  key  to  the  jackknife  Is  the  simple  device  of  dividing  a  body  of 
data  Into  subsamples,  creating  "pseudovalues”  by  computing  a  weighted 
difference  between  the  coefficients  produced  by  analysis  of  the  entire 
sample  and  the  coefficients  produced  by  analysis  of  each  of  the 
subsamples,  and  using  the  mean  and  standard  deviation  of  the  pseudovalues 
to  generate  estimates  of  the  parameter  and  Its  standard  error. 

The  first  step  of  the  jackknife  requires  that  the  body  of  data  be 
divided  Into  n  subgroups  of  (approximately)  equal  size.  Subsamples  are 
formed  by  deleting  each  of  the  subgroups.  In  turn,  from  the  body  of 
data.  Subsample  1  Includes  all  but  the  first  N/n  observations, 
subsample  2,  all  but  the  next  subgroup  of  N/n  observations,  and  so  on. 

In  step  two,  a  regression  analysis  Is  performed  on  the  total  sample  and 
on  each  of  the  n  subsamples.  In  step  three  these  matrices  of 
coefficients  are  converted  to  pseudovalues  as  computed  by  the  following 
equation: 


[6]  e(j)  -  (n  -  1) 
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where  Is  the  j[  vector  of  pseudovalues,  n  is  the  number  of 

subgroups,  b_  is  the  vector  of  regression  coefficients  derived  from  the 

analysis  of  the  entire  body  of  data,  and  is  the  coefficient  vector 

th  ^ 

derived  from  the  deletion  of  the  subsample.  Step  four  reduces  the 
obtained  pseudovalues  to  means  and  standard  deviations.  The  vector  of 


means  of  the  pseudovalues  is  the  jackknifed  estimate  of  the  regression 
coefficient  vector  (e^)  and  each  element  of  the  vector  of  standard 
deviation  of  the  pseudovalues  divided  by  the  square  root  of  £  is  an 
estimate  of  the  standard  error  of  the  corresponding  regression 
coefficient. 


The  purpose  of  the  experiment  reported  in  this  paper  was  to  compare 
OLS,  ridge  and  jackknife  estimates  of  regression  coefficients  under  some 
conditions  which  reasonably  approximate  those  existing  in  many  situations 
of  interest  to  psychologists.  Two  characteristics  of  data  that  are 
particularly  important  to  study  are  the  number  of  observations  relative 
to  the  number  of  predictors  (N/£  ratio)  and  the  violation  of  the 
assumptions  of  the  regression  model.  The  ]</£  ratio  should  be  considered 
since  low  levels  (N/££  5)  produce  unstable  estimates  of  regression 
weights  which  are  very  likely  to  be  of  the  "bizarre"  type  discussed 
earlier.  Violation  of  standard  regression  assumptions  should  be 
considered  since  failure  of  the  errors  to  conform  to  the  pattern, 

E(cc')  ■  o  I,  can  adversely  affect  the  estimates  of  the  standard  errors 
of  the  regression  coefficients. 
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METHOD 

Violation  of  regression  assumptions  was  produced  by  the  use  of 

discrete  (dichotomous  or  polychotomous)  rather  than  continuous  scales  of 

measurement  on  the  dependent  variable.  The  analysis  of  discrete 

dependent  variables  by  OLS  is,  of  course,  strongly  discouraged  since,  in 

2 

this  case,  E(ec')  ^  a  1.  This  problem  is  particularly  severe  with 
dichotomous  variables.  Thus,  the  use  of  two  different  levels  of  metric 
quality  provides  a  convenient  means  of  assessing  the  sensitivity  of  OLS, 
ridge  and  jackknife  to  different  levels  of  violation  of  regression 
assumptions.  Three  levels  of  ]</£  and  two  levels  of  metric  quality 
(dichotomous  and  polychotomous)  were  used  to  form  a  factorial  design. 

Coflinon  to  the  resulting  set  of  six  conditions  was  a  population 
predictor  matrix,  taken  from  Price  (1977),  shown  in  Table  1.  The  five 
predictor  variables  are  highly  col  linear,  as  seen  by  the  ratio  of  the 
largest  eigenvalue  to  the  smallest  eigenvalue  of  the  correlations  matrix 
(Xj/Xg  -  796). 


Insert  Table  1  About  Here 


Each  of  the  replications  of  each  of  the  six  cells  of  the  design 

matrix  required  the  estimation  of  twelve  regression  equations — one  OLS, 

one  ridge  and  ten  jackknife  equations.  In  view  of  this  substantial 

amount  of  computing,  the  study  was  limited  to  a  single  population 

regression  equation.  An  equal  weighted  equation  (£^  -  .226,  1  ■  1,5) 

2  2 

with  an  associated  •  1  -  R  .  .20,  was  chosen  as  a  case  which 
was  particularly  likely  to  challenge  the  utility  of  the  Hoerl,  et  al. 
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(1975)  bias  coefficient  estimator,  i^.  It  can  easily  be  seen  that  for 
this  population  regression  equation,  the  values  of  1^  estimated  from 
equation  [5]  will  equal  (in  the  population),  (5)(.2)/.354  «  2.826.  Since 
estimated  values  of  will  consistently  be  greater  than  unity  in  the 
sample  equations,  equal  weighted  predictors  will  produce  severely  biased 
estimates  of  the  . 

The  use  of  five  predictor  variables  and  three  levels  of  ^/£  (5/1, 
10/1  and  20/1)  produced  sample  sizes  of  25,  50  and  100.  These  can  be 
considered  to  be  marginally  acceptable  (5/1),  recommended  (10/1),  and 
highly  desirable  (20/1)  ratios  of  ^/£,  The  metric  quality  of  the 
predictor  variables  was  held  constant  (at  five  levels)  across 
conditions.  The  dependent  (criterion)  variable  set  at  either  five  levels 
(polychotomous)  or  two  levels  (dichotomous).  Results  for  each  of  the  six 
combinations  of  the  two  factors  were  based  upon  500  replications  of  each 
of  the  three  estimation  techniques.  That  is,  in  each  cell  of  the  design 
matrix,  500  separate  samples  were  generated. 

The  values  of  the  predictor  variables  in  each  sample  (the  X  matrix) 
were  computed  by  multiplying  a  matrix  of  independent  standard  (zero  mean 
and  unit  variance)  normally  distributed  pseudo-random  numbers  by  the 
matrix  of  eigenvectors  of  tho  predictor  R  matrix.  These  random  numbers 
were  generated  by  subroutine  GGNOR  (IMSL,  1977).  The  value  of  the 
criterion  variable  was  computed  by  post-multiplication  of  the  sample  X 
matrix  by  the  vector  of  standardized  population  regression  coefficients 
and  addition  of  an  independent  normally  distributed  pseudo-random  error 
term  (N  (0,  /77)).  Predictors  and  criterion  were  further  transformed  to 
the  appropriate  p-chotomous  (p  •  5,2)  variables  by  dividing  the  interval 


8 


between  -3  and  +3  standard  deviations  about  the  mean,  -3.0  <,*£■*■  3.0, 
into  £  equal  intervals.  The  discrete  transformation  process  has  a 
differential  effect  upon  the  moments  of  the  sample  distributions  of  the 
dichotomous  and  polychotomous  dependent  variables.  As  a  result,  "main 
effects"  of  metric  quality  are  not  only  uninteresting  (as  they  would  be 
in  any  event)  but  ambiguous,  as  well.  The  nature  of  discrete 
transformation  does  not,  however,  affect  the  interpretation  of 
interaction  effects — such  as  method  of  analysis  by  metric  quality  or 
method  of  analysis  by  metric  quality  by  sample  size — which  are  discussed 
below. 

Following  generation  of  the  sample  values,  the  three  regression 
procedures  were  executed.  The  standardized  OLS  coefficients  were 
computed  through  solution  of  equation  [3].  Standard  errors  of  the 
regression  coefficients  were  computed  and  ^-ratios  were  formed.  Ridge 
coefficients  were  computed  through  the  use  of  equation  [4]  after  the 
value  of  had  been  estimated  from  the  expression  proposed  by  Hoerl, 
Kennard  and  Baldwin,  contained  in  equation  [5].  Standard  errors  of  the 
ridge  coefficients  were  derived  from  the  square  roots  of  diagonal 
elements  of  the  matrix 

[7]  V(b*)  -  (j2(R  kl)-^  R  (R  >  kl)“l 

as  shown  by  Marquart  and  Snee  (1975,  p.  6).  Jackknifed  regression 
coefficients  and  their  standard  errors  were  computed  from  the  expressions 

[3]  8*i  -  /  n 

[9]  VAR  (a*i)  -  /  (n  -  1) 


At  the  end  of  the  500  replications,  the  accumulated  data  were  converted 
to  the  statistics  (a)  mean  squared  error  of  the  regression  weights, 

(b)  mean  bias,  (c)  mean  standard  error  of  the  regression  weight,  and 
(d)  mean  ^-statistic.  These  criteria  are  defined  by  the  expressions  in 
Table  2. 


Insert  Table  2  About  Here 


RESULTS 

The  data  contained  in  Table  3  show  that  the  MSE  of  OLS  decreases 
with  increasing  sample  size.  This  is  as  one  would  expect  since  MSE  is 
composed  of  two  components,  bias  and  variance.  Since  OLS  is  unbiased  and 
variance  decreases  with  sample  size,  one  should  find  this  result.  The 
pattern  is  repeated  at  both  levels  of  metric  quality  although — again  as 
one  might  expect — MSE  is  much  higher  for  a  dichotomous  criterion  variable 
than  for  a  polychotomous  criterion.  The  obtained  levels  of  MSE  for  the 
jackknife  coefficients  are  almost  identical  to  those  for  OLS. 


Insert  Table  3  About  Here 


Just  the  reverse  pattern  holds  for  the  ridge  estimator;  the  MSE 
increases  with  sample  size.  This  interaction  is  a  "crossover"  or 
disordinal  interaction.  At  5/1  and  10/1,  ridge  has  smaller  MSE  than  OLS, 
but  at  20/1,  OLS  has  smaller  MSE  than  ridge.  The  effect  is  even  more 
pronounced  for  a  dichotomous  criterion  than  for  a  polychotomous 
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criterion.  Ridge  is  much  less  sensitive  to  metric  quality  at  small 
sample  sizes  (N  *  25)  than  are  OLS  or  jackknife. 

Inspection  of  Table  4  suggests  an  explanation  for  this  result. 
Although  all  OLS  and  jackknife  estimates  are  slightly  biased  due  to  the 
attenuation  caused  by  the  discrete  transformation,  ridge  coefficients  are 
substantially  biased.  Moreover,  this  bias  increases  with  increasing 
sample  size. 


Insert  Table  4  About  Here 


Mean  standard  errors  and  mean  _t-statistics  associated  with  each  of 
the  three  estimation  methods  are  displayed  in  Tables  5  and  6.  Again, 
there  are  no  appreciable  differences  between  OLS  and  jackknifed 
estimates.  The  important  contrast  is  between  the  estimated  standard 
errors  for  ridge  coefficients  and  those  of  OLS  and  jackknifed 
coefficients.  The  estimated  standard  errors  of  the  ridge  coefficients 
are  not  only  consistently  smaller,  but  the  ratio  ^  (b)  /  ^  (b*)  gets 
larger  with  increasing  sample  size.  Metric  quality  affects  all  three 
estimators.  The  dichotomous  criterion  produces  higher  estimated  standard 
errors  and  lower  t-statistics  for  all  three  estimators. 


Insert  Tables  5  and  6  About  Here 


DISCUSSION 


The  use  of  jackknife  estimation  is,  under  conditions  like  those 
studied  here,  clearly  not  warranted.  Although  jackknifed  estimates  of 
the  are  virtually  Identical  to  the  OLS  estimates,  jackknifed 
coefficients  require  substantially  more  computational  effort.  The 
equivalence  of  jackknifed  regression  weights  and  OLS  weights  is  no  great 
surprise.  As  Wainer  and  Thissen  (1977)  have  noted,  studies  of  robust 
statistics  have  emphasized  their  usefulness  in  the  presence  of 
distributions  which  have  longer  tails  than  those  of  the  normal 
distribution.  The  results  obtained  here  are  consistent  with  their  remark 
that  OLS  will  do  quite  well  (compared  to  robust  methods  such  as  the 
jackknife)  with  short  tailed  distributions. 

Since  the  estimated  standard  errors  of  the  OLS  and  jackknifed 
coefficients  are  also  the  same,  the  mean  ^-ratios  are  virtually 
identical.  This  is  a  substantial  factor  working  against  the  jackknife. 

In  the  case  of  the  ^-statistics,  there  is  not  only  the  additional 
computational  burden,  but  the  jackknifed  ^-statistic  is  based  upon  fewer 
degrees  of  freedom  (^  -  1  for  the  jackknife  vs.  N  -  £  -  for  OLS).  The 
difference  is  substantial  even  at  the  lowest  N/£  ratios  studied  here. 

The  superiority  of  the  Hoerl,  Kennard  and  Baldwin  (HKB)  ridge 
estimator  over  OLS  at  low  levels  of  N/£  raises  some  interesting 
questions.  Although  the  bias  coefficients  provided  by  equation  [51  are 
intended  to  produce  estimates  which  have  smaller  expected  mean  square 
error  than  OLS,  they  do  not  necessarily  produce  minimum  mean  squared 
error.  The  extreme  level  of  bias  produced  in  this  situation  leads  one  to 
wonder  whether  a  less  severely  biased  estimator  might  prove  to  be 
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superior  to  both  OLS  and  the  HKB  estimator.  In  support  of  the  HKB 

estimator.  It  should  be  noted  that  It  can  produce  "reasonable  looking" 

values  for  k^.  For  example,  when  applied  to  the  Price  (1977)  data. 

Including  the  Initial  OLS  estimates  of  the  obtained  by  that  author, 

equation  [5]  produces  an  estimate  of  that  Is  virtually  Identical  to  the 

value  that  was  derived  subjectively  from  Inspection  of  the  ridge  trace. 

Although  the  "true"  model  underlying  the  Price  data  Is  unknown,  the 

Initial  OLS  estimates  of  the  regression  coefficients  varied  considerably 

2 

in  magnitude.  This,  In  turn,  produced  a  large  which  resulted 

in  a  small  (1^  -  .06)  value  of  the  bias  coefficient.  Whether  or  not  the 
HKB  estimator  can  be  Improved  upon,  the  fact  the  choice  between  OLS  and 
ridge  regression  Is  affected  by  the  N/£  ratio  suggests  that  the 
Importance  of  this  factor,  hitherto  Ignored  In  Monte  Carlo  studies  of 
ridge  regression,  ought  to  be  noted. 

This  conclusion  may  appear  to  be  inconsistent  with  that  of  McDonald 
and  Galarneau  who  stated  that  the  performance  of  ridge  estimators 
"depends  on  the  variance  of  the  random  error,  the  correlations  among  the 
explanatory  variables  and  the  unknown  coefficients  vector"  (1975, 
p.  416).  However,  the  N/£  ratio  and  the  metric  quality  of  the  variables 
both  act  together  with  the  true  error  variance  to  Influence  the  estimated 
variance  of  the  random  error.  They  also  act  together  with  the 
correlations  among  the  explanatory  variables  and  the  unknown  coefficient 
vector  to  Influence  the  OLS  estimates  of  the  regression  coefficients. 
Since  both  the  estimated  variance  of  the  random  error  and  the  estimated 
OLS  regression  coefficient  vector  are  used  to  calculate  the  proper  amount 
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of  bias,  1^,  to  Introduce  Into  the  ridge  estimator,  there  Is  no  real 
conflict  between  the  results  of  this  study  and  the  conclusions  drawn  from 
prior  studies  of  ridge  regression. 

That  the  standard  errors  of  these  ridge  coefficients  are 
substantially  smaller  than  the  corresponding  OLS  estimates  and  the 
Jt-statlstics  are  correspondingly  larger  is  also  an  Interesting  finding. 

To  date,  there  has  been  no  Indication  that  these  t-statlstics  are 
actually  distributed  as  Student's  and  if  so,  with  what  number  of 
degrees  of  freedom.  The  data  shown  in  Table  6  suggest  that  an 
investigator  who  used  the  ridge  estimates  of  the  ^-statistics  as  a  rough 
indicator  of  statistical  significance  would  be  quite  likely  to  reject  the 
null  hypothesis,  •  0.  Such  a  procedure  would,  however,  also  lead 
one  to  reject  a  specific  alternative  hypothesis  of  ■  .226,  the 
population  regression  coefficients  for  this  data  set.  To  argue  that 
specific  alternative  hypotheses  are  virtually  nonexistent,  though  true, 
seems  to  beg  the  issue.  A  more  satisfactory  resolution  to  the  problem 
may  be  to  assert  that  bias  in  regression  weights  is  a  relatively 
unimportant  consideration.  What  does  make  a  difference  to  many,  if  not 
most  Investigators,  Is  that  the  ratios  of  the  weights  be  accurately 
determined.  Put  otherwise,  one  wants  the  orientation  of  the  obtained 
weight  vector  in  the  parameter  space  to  be  close  to  that  of  the  true 
regression  vector.  Estimates  which  are  proportional  to  the  true  weights 
would,  by  this  criterion,  be  equivalent.  It  seems  reasonable  that 
shrunken  estimators  such  as  ridge  would  be  more  likely  than  OLS  to 
satisfy  this  proportionality  criterion.  The  nature  of  the  types  of 
conditions  under  which  they  would  do  so  remains  to  be  determined. 


FOOTNOTES 


Support  for  this  project  was  provided  by  Office  of  Naval  Research 
contract  N00014-77-C-0127;  E.  E.  Lawler,  III,  Principal  Investigator. 
Opinions  expressed  are  those  of  the  author  and  are  not  to  be  considered 
as  necessarily  reflecting  the  official  view  or  endorsement  of  the 
Department  of  the  Navy.  Requests  for  reprints  should  be  sent  to  the 
author  at  Battel le  Human  Affairs  Research  Centers,  P.O.  Box  C-5395, 
Seattle,  Washington,  98105. 
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TABLE  2 


Criterion 

Expression 

mo 

500 

5 

A  0 

Mean  squared  error 

z 

t=l 

Z 

1=1 

(S„  - 

1 

500 

5 

Bias 

z 

Z 

1-1 

(e,(  ■  l„) 

mo 

500 

5 

Mean  standard  error 

z 

t=l 

Z 

1=1 

SE  (B,j) 

1 

?5oo 

500 

5 

Mean  t-statlstic 

z 

Z 

1=1 

tg 

TABLE  3 

Mean  Squared  Error  of  Coefficients 

Metric  Quality  of  Criterion 

Dichotomous  Polychotomous  (5) 

Sample  -  - 


Size 

OLS 

Ridge 

Jackknife 

OLS 

Ridge 

Jackknife 

25 

.607 

.111 

.638 

.333 

.080 

.354 

50 

.262 

.131 

.266 

.135 

.092 

.139 

100 

.125 

.149 

.125 

.066 

.101 

.067 

TABLE  4 


Bias  of  Coefficients 


Metric  Quality  of  Criterion 


Polychotomous  (5) 


Dichotomous 


Jackknife 


Jackknife 


TABLE  5 

Mean  Standard  Error  of  Coefficients 

Metric  Quality  of  Criterion 

Dichotomous  Polychotomous  (5) 

Sample  -  - 


Size 

OLS 

Ridge 

Jackknife 

OLS 

Ridge 

Jackknife 

25 

.309 

.052 

.351 

.239 

.054 

.271 

50 

.204 

.023 

.214 

.157 

.027 

.131 

100 

.140 

.012 

.142 

.107 

.017 

.109 

TABLE  6 

Mean  t-StatistIc  for  Coefficients 

Metric  Quality  of  Criterion 
Dichotomous  Polychotomous  (5) 


Sample 


Size 

OLS 

Ridge 

Jackknife 

OLS 

Ridge 

Jackknife 

25 

.736 

1.802 

.702 

1.081 

2.148 

1.030 

50 

1.086 

2.997 

1.105 

1.594 

3.481 

1.631 

100 

1.561 

4.577 

1.647 

2.308 

5.234 

2.389 
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2450  Massachusetts  Avenue,  N.W. 
Washington,  DC  20008 
ATTN:  CDRD 

Mr.  Luigi  Petrullo 

2431  North  Edgewood  Street 

Arlington,  VA  22207 

Dr.  Eugene  F.  Stone 
Assistant  Professor  of  Administrative 
Sciences 

Krannert  Graduate  School 
Purdue  University 
West  Lafayette,  IN  47907 

Mr.  Mark  T.  Hunger 
McBer  and  Company 
137  Newbury  Street 
Boston,  MA  02116 

Commandant 

Royal  Military  College  of  Canada 
Kingston,  Ontario 
K7L  2W3 

ATTN:  Department  of  Military 

Leadership  and  Management 

Dr.  Richard  T.  Mowday 
Graduate  School  of  Management 
and  Business 
University  of  Oregon 
Eugene,  OR  97403 

National  Defense  Headquarters 
Ottawa,  Ontario 
KIA  0K2 
AHN:  DPAR 
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CDR  William  A.  Earner 
Management  Department 
Naval  War  College 
Newport.  RI  02940 


Mr.  Martin  Mllrod 

Educational  Equity  Grants  Program 

1200  19th  Street,  N.W. 

National  Institute  of  Education 
Washington,  DC  20208 

CAPT  Richard  L.  Martin,  USN 

Commanding  Officer 

USS  Francis  Marlon  (LPA-Z49) 

FPO  New  York  09501 


ATTN:  Library 

ARI  Field  Unit  -  USAREUR 

c/o  DCSPER 

APO  New  York  09403 


MAJ  Robert  Mil  trout 

Mr.  Richard  Grann 

U.S.  Army  Trimis-Evaluatlon  Unit 

Walter  Reed  Army  Medical  Center 

Washington,  O.C.  20012 


Mr.  Thomas  N.  Martin 
Department  of  Administrative  Sciences 
College  of  Business  and  Administration 
Southern  Illinois  University 
Carbondale,  IL  62901 


